Accepted by the Astrophysical Journal 

Preprint typeset using 1^1^^ style cmulatcapj v. ^jllj^A, 



NOAO FUNDAMENTAL PLANE SURVEY — IL AGE AND METALLICITY ALONG THE RED SEQUENCE 

FROM LINESTRENGTH DATA 

Jenica E. Nelan 

Department of Physics and Astronomy, Dartmouth College, Hanover, NH, USA 

Russell J. Smith, Michael J. Hudson 

Department of Physics, University of Waterloo, Waterloo, Ontario, Canada N2L 3G1 



in 
o 
o 

(N 



Gary A. Wegner 

Department of Physics and Astronomy, Dartmouth College, Hanover, NH, USA 



John R. Lucey, Stephen A. W. Moore, Stephen J. Quinney 

Department of Physics, University of Durham, South Road, Durham DHl 3LE, United Kingdom 



o 

m 

o 
in 
o 

o 



Nicholas B. Suntzeff 

Cerro Tololo Inter- American Observatory, Casilla 603, La Serena, Chile 
Accented by the Astrophysical Journal 

ABSTRACT 

We present spectroscopic linestrength data for 4097 red-sequence galaxies in 93 low-rcdshift galaxy 
clusters, and use these to investigate variations in average stellar populations as a function of galaxy 
mass. Our analysis includes an improved treatment of nebular emission contamination, which affects 
~ 10% of the sample galaxies. Using the stellar population models of D. Thomas and collaborators, 
we simultaneously fit twelve observed linestrength— ct relations in terms of common underlying trends 
of age, [Z/H] (total metallicity) and [a/Fe] (a-element enhancement). We find that the observed 
linestrength-cr relations can be explained only if higher-mass red-sequence galaxies are, on average, 
older, more metal rich, and more a-enhanced than lower-mass galaxies. Quantitatively, the scaling 
relations are age oc a°-^^'^°-^^, Z/H ex (t°-^3=^° °^ and a/Fe cx (t°-3i=^°-°^, where the errors reflect 
the range obtained using different subsets of indices. Our conclusions are not strongly dependent 
on which Balmer lines are used as age indicators. The derived age— ct relation is such that if the 
largest (ct ~ 400km s""'^) galaxies formed their stars ~ 13Gyr ago, then the mean age of low-mass 
(ct ^ 50km s^^) objects is only ~4Gyr. The data also suggest a large spread in age at the low-mass 
end of the red sequence, with 68% of the galaxies having ages between 2 and 8 Gyr. We conclude 
that although the stars in giant red galaxies in clusters formed early, most of the galaxies at the faint 
end joined the red sequence only at recent epochs. This "down-sizing" trend is in good qualitative 
agreement with observations of the red sequence at higher redshifts, but is not predicted by semi- 
analytic models of galaxy formation. 

Subject headings: surveys — galaxies: clusters: general — galaxies: elliptical and lenticular, cD — 
galaxies: evolution 



1. INTRODUCTION 

Although early-type galaxies contain the bulk of the 
stellar mass in the low redshift Universe, their formation 
histories remain poorly understood. Early-type galaxies 
lie on a tight "red sequence" in the color- magnitude dia,- 
gram fea ndaee fc Visvanathan .1978; Bower et alJll992|) 
and follow well-known dyna mical scaling re l ations , 
such as the Faber- Jackson ijFaber fc JacksonI Il97fif) . 
the Dn — CT (Dressie r! Il987^. and the Fundam ental 
Plane (FP) relations (|Diorgovski fc Davi^ Il987|) . In 
addition to these scaling relations, elliptical galaxies 
also exhibit systematic correlations between spectro- 
scopic absorption linestrengths and velocity dispersion, 
CT. Especially well studied are the linestrength cen- 
tered on the magn esium tripl et of absorption lines 
near A 5175 with ct llBender et al ,1993; Wegner et al.l 
119991 iKuntschner et alJ l200lt iBernardi et all booa 



iDavies et al.lll98'7|) . as well as iron lines and the Balmer 
series (e.g., Mehlert et al. 2003). 

The tightness of these scaling relations have gener- 
ally been interpreted as evidence for coeval formation 
of early- type galaxies (Bower et al. 1992), with the 
slope of the color-magnitude relation arising from a mass- 
metallicity sequence. However, attempts to fit their spec- 
tra with stellar population models suggest that the for- 
mation of early-types may be more co mplicated, and 
that there may be a spread in their ages ( Y^ort he vljl^^^ 
[Thomas et alll2003t iTrager et al.ll2000al ICaldwcU etal) 
12003). Recent studies of early-type galaxies at high red- 
shift indicate that the scatter and zero-point of the FP 
relation varies with redshift ( Wuvts ct al. 2004) , and may 
indicate that stellar populati on e ffects and age both play 
roles in determining the FP l)van Dokkum & Ellis 2003) . 

A primary goal of the NOAO Fundamental Plane Sur- 
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vey (NFPS) is to address these questions by using early- 
type galaxies in nearby clusters {z < 0.07). The NFPS 
dataset consists of photometry and spectroscopy for 5479 
galaxies belonging to 93 clusters, and is ideal for study- 
ing the evolution and properties of early-type galaxies 
in the cluster environment. The galaxies that have been 
selected for spectroscopic analysis are all cluster "red- 
sequence" galaxies. Therefore they are mostly elliptical 
or SO galaxies, but there is no explicit selection by mor- 
phology. 

The NFPS data are to be presented in a series of 
papers. Paper I (Smith et al. 2004; hereafter NFPS- 
I) described the goals and selection of the survey and 
contains redshifts and velocity dispersions. The pur- 
pose of this paper, the second in the series, is to cor- 
relate the linestrengths extracted from our high quality 
spectroscopic data with our velocity dispersions and use 
the resulting relations to investigate the broad trends 
of stellar age and metallicity along the mass sequence. 
Section 13 presents the measurements of the absorption 
linestrengths and their errors and in Section |21 we de- 
tail our new method of measuring emission lines in the 
H/3 linestrength. In Section 01 we describe how we chose 
which galaxies to include in our final sample. Section |31 
shows the derived linestrength-(T relations for many of 
the linestrengths, and in Section ^ we use stellar pop- 
ulation models to derive global age, metallicity and a- 
element enhancement, [a/Fe], trends as a function of ve- 
locity dispersion. We compare our trends to those from 
other studies and to results from intermediate-redshift 
observations in Section [7| 

2. ABSORPTION-LINE MEASUREMENTS 
2.1. Spectroscopic Data 

NFPS-I l|Smith et alJl2004|) contains detailed descrip- 
tions of the selection of the data for spectroscopic 
follow-up along with the observations and measurements 
of redshifts and velocity dispersions. To summarize, 
NFPS clusters were X-ray selected from the XBACS 
(jEbehng et al.llT99^ and BCS l|Ebeling et al.l ll998) cat- 
alogs for imaging. Individual galaxies were selected for 
follow-up spectroscopic observing based on their posi- 
tion in the cluster color-magnitude diagram relative to 
the red-sequence ridge-line. Specifically, galaxies with 
i? < 17 and A{B — R) > 0.2 were chosen for spectro- 
scopic observation. As noted above, there are no explicit 
morphological selection criteria. 

NFPS spectroscopic observations were carried out us- 
ing the Hydra multi-fiber spectrographs at the CTIO 
Blanco 4m telescope and the 3.5m WIYN telescope at 
Kitt Peak. Approximately 50-70 galaxies were observed 
in each cluster. Spectral resolution for both CTIO and 
WIYN is 3 A with CTIO data sampled at 1.15 A pixel" ^ 
and WIYN data sampled at 1.4 A pixel" ^. The median 
S/N for both WIYN and CTIO data is 22. Variance- 
weighted extraction of the spectra, along with cosmic 
ray rejection, wavelength calibration and sky subtrac- 
tion were performed using tasks in the HYDRA package 
in IRAF. 

2.2. Linestrength measurements 

The linestrength indices used in t his paper are 
from the original Lick/IDS system (|Burstein et al.l 



TABLE 1 
NFPS Ho Line Indices 



Line Index 


Bandpass (A) 


Pseudocontinua (A) 


HoF 


6554.000-6568.000 


6515.000-6540.000 






6568.000-6575.000 


HoA 


6554.000-6575.000 


6515.000-6540.000 






6575.000-6585.000 



119841) and its extensi ons to high-order B ai mer lines 
iWorthev fc Ottavianil 119971. including H/3+ ijGonzalezI 
11993*). Note that the US a and HSp linestrength from 
Worthcv & Ottaviani (1997) differ shghtly from the US 
hnestrength used in, e.g.. IBalogh et alJ (|1999l) : the H^a 
central bandpass is almost identical to their US, but the 
red and blue continua differ by ~ 10 A on eac h side. RSf is 
a much narrower linestrength than that of IBalogh et alJ 
(1999). 

Additionally, we defined and measured two new 
linestrengths around the Ha line, which falls within 
the range of the WIYN spectra, for ~700 galaxies with 
cz <^ 13000 km s~^. The two linestrengths, summarized 
in Table ^ differ in their sensitivity to contamination 
from neighboring [Nil] emission lines. Hap has a nar- 
row definition, with a narrow red continuum bandpass 
between the [Nil] 6583 A emission line and Ha. This is 
primarily designed to detect Ha emission where present. 
HaA has a wider red continuum, which is contaminated 
by Nil emission when present. In the absence of emis- 
sion, however, HaA should be the more reliable indicator 
of stellar Ha absorption. 

The molecular TiOi and Ti02 linestrengths arc only 
measured in ~1000 of our galaxies, most of which are 
from our northern clusters since the wavelength range 
of the WIYN spectrograph extends further into the red 
than that at CTIO. Similarly, relatively few galaxies have 
Ha measurements for the same reason. 

Prior to measuring linestrengths, the galaxy spectra 
were corrected to an approximate relative fiux scale 
by comparison with a model elliptical template from 
iKinnev et alJ (11996). However, because we did not ob- 
serve Lick calibration stars, we have not attempted to 
correct our galaxy spectra in order to directly match 
to the Lick fiux system. Because each linestrength is 
defined using a pair of pseudocontinua bracketing the 
line-feature, the linestrength indices are robust against 
differences in the Lick and NFPS response functions un- 
less there is substantial relative curvature over the ex- 
tent of a linestrength. The most susceptible definitions 
are those with the widest extent, viz. the "molecular" 
hnestrengths Mgi_2, TiOi,2, and CNi^2- Caution should 
be used when analyzing these linestrengths, especially if 
the sample covers a substantial range in redshift. Note 
also that the NFPS spectra are unsuitable for the mea- 
surement of discontinuity linestrengths such as D4000 
(Bruzual 1993), since these are very sensitive to flux- 
calibration. Moreover, our instrumental setup, especially 
at WIYN, has very poor response at <4000A. 

To measure linestrengths, we chose to use the pro- 
gram INDEXF (Cardiel, Gorgas and Cenarro; see Ce- 
narro et al. 2001) because of its careful calculation of 
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linestrength errors, following ICardiel et alJ iITqqI . The 
estimated error spectrum (incorporating Poisson-errors 
and a contribution from sky-subtraction noise) yields a 
"photometric" error, which is added in quadrature with 
the (Monte-Carlo derived) errors due to the uncertainty 
in radial velocity, yielding a total error due to photon 
noise, Cphot- 

The linestrengths are measured both at the ~3 A na- 
tive resolution of the NFPS spectra and also at the ~9 A 
resolution of the Lick system. The galaxy spectra were 
broadened to the Lick resolution fo r each linestrength fol- 
lowing the resolution curve from iWorthev fc Ottavianil 
Ijl997|l . For the NFPS Ha hnestrengths that he red- 
ward of the Lick spectral range, we smoothed by 10.5 
A FWHM. The Lick-resolution linestrengths are suit- 
able for c omparison to sy nthesis models such as those of 
Worthey l|Worthevlll994D . based on low- resolution stellar 
libraries. The full-resolution linestrengths in principle 
retain more detailed information (at least for galaxies 
with small velocity broadening), and can be analyzed in 
compariso n to higher-resol ution synthesis models, such 
as those of lVazdekisI ()1999|) . 

2.3. Velocity broadening and aperture corrections 

Galaxy spectra are broadened by the line-of-sight ve- 
locities of their stars, generally resulting in a dilution 
in measured linestrength with increasing velocity disper- 
sion, a. We followed the standard procedure of determin- 
ing correction curves from artificially broadened K-giant 
stellar spectra. The correction curve for the WIYN data 
is the average of the curves from nine stars while the 
CTIO correction curve is the average of the curves from 
twelve stars. These corrections are very stable and there- 
fore we applied a common correction curve to data from 
all of the observing runs. For our Lick-resolution galaxy 
spectra, the stars were first broadened to the Lick reso- 
lution before deriving the correction curves. Our correc- 
tions are very similar to those obtained by other groups 
(e.g., Poggianti et al. 2001). The velocity broadening 
corrections derived from different template stars agree 
within ^10% for all indices except H/3. However, in ab- 
solute terms the velocity broadening correction for H/3 is 
quite small (^ 20% of the index value). 

The NFPS spectra sample the galaxy light within a 
fixed 2 arcsec diameter. This angular scale samples differ- 
ent physical scales for galaxies at different angular diam- 
eter distances. Because galaxies have internal gradients, 
it is necessary to correct the raw data for such aperture 
effects. In NFPS-I, we correcte d the velocity dispersion s 
according to the prescription of lJorgensen et alTl)1995|) . 

Aperture corrections can also be applied to 
linestrengths to correct for increasing apparent galaxy 
size at higher redshift. We followed the formula 

-^cor — -^ap "t" ^ap (1) 

where /ap is the uncorrected linestrength value measured 
through a 2 arcsec aperture, and Aap is given by 

Aap = log . (2) 

The linestrength gradient k = (ilog(index)/(ilog(rap), 
and ^ is a ratio of angular diameter distances. Here do 
is a normalization factor defined as the angular diameter 



TABLE 2 
Aperture Corrections 



Line Index 


Units 


K 


(Aap) 


(A 


2 \i 
^ ) 2 
ap/ 


H<5a 


A 


1.283 


-0.071 


0. 


,172 


H(5p 


A 


0.351 


-0.019 


0. 


,047 


CNi 


map" 


-0.058 


0.003 


0, 


,008 






-0.121 


0.007 


0. 


016 


H'Va 




1.349 


-0.075 


Q 


181 


H'Yp' 




0.629 


-0.035 


Q 


084 






-1.484 


0.082 


0. 


199 


Kya'-t'-tOO 


A 


-0.226 


0.013 


0. 


030 


Fe4531 


A 


-0.544 


0.030 


0. 


,073 




A 


-2.580 


0.143 


0. 


,346 




X 

I\ 


u.uuu 


n nnn 
u.uuu 


u. 


nnn 
uuu 


reoUio 


A 


-1.346 


0.075 


0. 


,180 


Mgi 


mag 


n (\A(\ 
-U.U4U 




n 

u. 






mag 


-0.066 


005 


Q 


Oil 


Mg6 


A 


-1.375 


0.080 


0, 


,182 


Fe5270 


A 


-0.706 


0.091 


0. 


095 


Fe5335 


A 


-0.706 


0.049 


0, 


,118 


Fe5406 


A 


-0.460 


0.014 


0. 


,062 


Fe5709 


A 


-0.019 


0.001 


0. 


,003 


Fe5782 


A 


-0.019 


0.001 


0, 


,002 


Na5895 


A 


-0.045 


0.006 


0, 


,006 


TiOi 


mag 


-0.016 


0.002 


0. 


,003 


Ti02 


mag 


-0.011 


0.001 


0. 


,001 


HaA 


A 


0.035 


-0.008 


0, 


,003 


H«F 


A 


0.035 


-0.008 


0, 


,004 



distance at z = 0.05. For each galaxy, d is the angular di- 
ameter distance corresponding to the cluster CMB-frame 
redshift; for non-cluster galaxies, d is a function of the 
CMB-frame redshift of the galaxy itself. In calculating 
d, we adopted a cosmology with f7„i — 0.3 and Q\ = 0.7. 
Thus the physical diameter of the corrected aperture di- 
ameter is 1.37 h~^kpc. Note that this correction is to 
a fixed metric diameter and not to a multiple of the ef- 
fective radius. Re- The gradients k were compiled from 
other groups who either measured internal linestrength 
gradients at varying radii or extracted multiple apertures 
for a galaxy, and are summarized in Table |3I In partic- 
ular, gr a dients either from iKuntschner et alJ l|2002r) or 
iProctod l|200l were used for most of our linestrengths. 
Table El also lists the mean difference between the cor- 
rected and uncorrected linestrengths for each line index, 
and the dispersion in this difference. The difference be- 
tween the corrected and uncorrected linestrengths varies 
by galaxy, but is generally at the level of a few percent. 

The aperture corrections in velocity dispersion and 
linestrengths were not applied for purposes of compar- 
isons in Sections 12.41 and 12.51 However in Section [3 
and subsequent sections, the velocity dispersions and 
linestrengths are corrected for aperture effects. 

2.4. Inter-run comparisons 

As described above, we chose to measure our 
linestrengths with INDEXF largely because of its method 
of calculating errors, which allows for noise in the spectra 
as well as errors in the radial velocities. However, given 
the possibility of systematic errors, it is useful to check 
the consistency of the error estimates by comparing re- 
peat observations of the same galaxies. 
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Within the NFPS survey several "standard" equato- 
rial clusters were re-observed on several runs, and from 
each telescope. The primary goal of this was to ensure 
that velocity dispersions were on a consistent base system 
and to correct discrepant runs if necessary. In NFPS-I, 
we showed that velocity dispersions could be calibrated 
to a systematic accuracy of ~0.004 dex (thus limiting 
systematic errors in FP-derived distances to <1.5%). 

As an example of the consistency of our linestrength 
measurements, we compare the Mg6 instrumental- 
resolution absorption line measurements in galaxies that 
were observed in multiple runs at the same telescope and 
also at both sites. In Figure ^ we show comparisons of 
galaxies observed in more than one CTIO or WIYN run. 

While Poisson errors are the dominant source of error 
in the linestrength index measurements, results for the 
same galaxy may also differ due to other effects such 
as fiber position errors and variations in fiber trans- 
mission. Such errors will affect individual measure- 
ments in a way that is likely to be independent of the 
signal-to-noise ratio. To allow for these effects, we in- 
clude an "external error," efib, assumed to be constant 
for all measurements (but of course different from one 
linestrength to another), added in quadrature with the 
estimated measurement errors, so that the total random 
error Cran = (cphot^ + efib^)"^^^- We determine the value 
of this external error by requiring a good for repeated 
measurements of the galaxies in common. We find that 
for most indices, the external errors are typically much 
smaller than the typical random errors. Nevertheless, it 
is necessary to account for them since accurate character- 
ization of measurement errors will be important in Sec- 
tion when we attempt to determine the intrinsic pop- 
ulation variation in linestrength. For six indices (Mgi^2, 
TiOi.2, HofA, Hap), the external errors are slightly larger 
than the typical random error. We do not use any of 
these six indices to determine stellar population param- 
eters. 

There is also the possibility of small systematic off- 
sets in the linestrength measurements between runs, due, 
for example, to variations in seeing or to different in- 
strumental configurations. To calculate additive correc- 
tions, Arun, needed to bring the linestrength measure- 
ments from different runs onto a common system, we 
use the repeat measurements and follow the same pro- 
cedure as was used for velocity dispersions in NFPS-I. 
We find in that in most cases (with the exception of the 
Ha indices) the run corrections are smaller than the typ- 
ical random errors and are very much smaller than the 
range spanned by the linestrength data. Nevertheless, 
we prefer to include these small corrections so as not 
to bias future bulk flow measurements derived from the 
FP relation which also incorporate these linestrengths as 
additional parameters. These corrections are applied to 
the raw linestrengths before the data are merged to yield 
a final set of linestrength data for each galaxy. In Ta- 
ble 13 for each linestrength we list the total dispersion in 
the measured linestrengths for a given index, the typical 
photon error, Cphot, the systematic error, egb, the total 
random error, Eran, and the typical (root-mean-square) 
run correction {A'^^^)^. 

2.5. Comparisons with other surveys 



TABLE 3 
Linestrength Index Errors 



Line Index 


/ r9 \ i a 
(I ) 2 


b 

^phot 






1 \ 9 \ i 

( ) 2 ^ 
\^run/ 




2.14 


0.91 


0.58 


0.99 


0.30 


H6f 


1.24 


0.61 


0.33 


0.64 


0.13 


CNi 


0.06 


0.02 


0.02 


0.03 


0.01 


CN2 


0.07 


0.03 


0.02 


0.03 


0.02 




0.60 


0.40 


0.20 


0.42 


0.06 


G4300 


1.08 


0.61 


0.36 


0.66 


0.28 


H'YA 


1.51 


0.64 


0.45 


0.73 


0.32 




0.88 


0.39 


0.20 


0.41 


0.08 


J. C'rtOOO 


1.15 


0.75 


0.37 


0.78 


0.26 




0.53 


0.43 


0.00 


0.40 


0.08 


Fe4531 


0.77 


0.53 


0.25 


0.55 


09 


J. CrrVJUO 


1.51 


0.68 


0.50 


0.80 


0.33 


np 


0.60 


0.27 


0.11 


0.28 


0.05 




0.52 


0.18 


0.09 


0.20 


0.02 


J- dJ\J -LiJ 


1.04 


0.59 


0.33 


0.64 


0.14 


iVigl 


0.03 


0.01 


0.01 


0.01 


0.01 


Mg2 


0.04 


0.01 


0.01 


0.01 


0.01 




0.66 


0.26 


0.12 


0.27 


0.03 


Fe5270 


0.44 


0.28 


0.11 


0.28 


0.04 


Fe5335 


0.50 


0.34 


0.11 


0.34 


0.05 


Fe5406 


0.35 


0.25 


0.11 


0.26 


0.04 


Fe5709 


0.27 


0.20 


0.10 


0.21 


0.02 


Fe5782 


0.27 


0.17 


0.10 


0.19 


0.05 


Na5895 


0.95 


0.18 


0.16 


0.23 


0.08 


TiOi 


0.01 


0.00 


0.01 


0.01 


0.00 


Ti02 


0.01 


0.00 


0.01 


0.01 


0.00 


HaA 


1.23 


0.15 


0.15 


0.21 


0.31 


HaF 


2.67 


0.14 


0.19 


0.24 


0.09 


''The total 


dispersion 


over the 


NFPS 


in the measured in- 



dex, arising from intrinsic population differences and mea- 
surement error. 

''Median random error. 

"^Mean systematic error. 

''Median total random error in merged sample. 
"Root-mean-square correction over all runs. 



We compare our linestrengths with data from two 
other surveys: ~140 galaxies overlapping with the sec- 
ond data release fro m the Sloan Digital Sky Survey 
|Abazaiiai^^et^PP04) and 33 Coma Cluster galaxies 
from lMoore et a l. ( 2002) . The comparison with Moore et 
al. are at the Lick resolution. SDSS linestrengths avail- 
able in their archive are at their instrumental resolution 
(2.4 A) while our instrumental resolution linestrengths 
are at 3 A. Using the ratio of our Lick resolution 
linestrengths to our full resolution linestrengths, we 
"scaled" the SDSS linestrengths to 3 A resolution. The 
correction to each linestrength is typically very small 
(~2%). None of the data sets have aperture corrections 
applied for these comparisons. 

In Table 0] we compare our measurements with those 
from other surveys, for several linestrengths in common. 
The mean offsets are defined as A =< /nfps - Mother > 
where /nfps is the NFPS linestrength and /other is the 
SDSS or Moore et al. linestrength, and the rms is the 
standard deviation of the differences. 

We plot the NFPS linestrengths against those of Moore 
et al. in Figure 12 Our data are in very good agree- 
ment with those of Moore et al.; with the exception of 
Mgi, which, as noted above, is susceptible to flux cali- 
bration error, there is no evidence for systematic offsets. 
The values of the comparisons are acceptable for all 
hnestrengths except for Fe4668, Fe5270 and Mg2. This 
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Fig. 1. — Overlapping observations between CTIO runs and WIYN runs. The line in each plot has a slope of unity, representing where 
galaxies should fall if there is no offset between multiple observations. Crosses indicate galaxies more than Aa away from the mean weighted 
least squares fit between the two axes. Average errors are shown in the upper left of the plot and the number of galaxies is displayed in 
the lower right. Mg6 is in units of angstroms and measured at full (instrumental) resolution. 



TABLE 4 
Line Index Offsets 



Survey Line Index 


Units 




Mean offset 


rms 


Moore et al Fe4668 


A 


34 


0.236±0.174 


0.694 


H/3 


A 


34 


-0.035±0.034 


0.625 


Fe5015 


A 


34 


0.186±0.119 


0.769 


Mgi 


mag 


34 


0.017±0.002 


0.019 


Mg2 


mag 


34 


-0.005±0.003 


0.030 


Mgb 


A 


34 


-0.079±0.036 


0.473 


Fe5270 


A 


34 


-0.039±0.058 


0.420 


Fe5335 


A 


34 


0.063±0.049 


0.340 


SDSS Ca4227 


A 


141 


0.102±0.039 


0.468 


Fe4383 


A 


141 


0.261±0.096 


1.139 


Ca4455 


A 


141 


0.271±0.046 


0.543 


Fe4531 


A 


141 


0.231±0.092 


1.090 


Fe4668 


A 


141 


-0.046±0.115 


1.362 


H/3 


A 


141 


0.021±0.045 


0.540 


Fe5015 


A 


140 


0.601±0.078 


0.920 


Mgfe 


A 


136 


0.126±0.039 


0.455 


Fe5270 


A 


64 


0.220±0.066 


0.529 


Fe5335 


A 


102 


0.406±0.054 


0.547 


Fe5406 


A 


135 


0.208±0.037 


0.431 


Fe5709 


A 


129 


0.095±0.032 


0.366 


Fe5782 


A 


84 


0.021±0.031 


0.287 


Na5895 


A 


71 


0.083±0.052 


0.437 




468 10 123 346 

NFPS Fe4668 NFPS H(3 NFPS Fa5015 




NFPS Fe5a70 NFPS Fe5335 



Fig. 2. — NFPS linestrength s for s everal line indices plotted 
against those from IMoore et a 120021) . Solid Unes with a slope 
of unity indicate exact agreement between the two sets of mea- 
surements. All measurements are at Lick resolution. The average 
NFPS and Moore et al. linestrength error is shown in the upper 
left corner of each plot. 



suggests that, in most cases, both NFPS and Moore et 
al.'s errors are reasonable. 

The comparison with SDSS is shown in FigureO Note 
that for hnestrengths in the blue, the quoted SDSS er- 
rors are smaller, whereas for linestrengths in the red, 



the NFPS errors are typically smaller. There is evi- 
dence for small but significant offsets for most of the 
linestrengths, as indicated in Table 0] Furthermore, for 
most linestrengths, the reduced values are larger than 
unity, suggesting that the errors in either SDSS or NFPS 
(or both) are underestimated. Overall, however, based 
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on the consistency between our measurements and those 
of Moore et al. we conclude that our error estimates, 
which include systematic effects estimated from repeat 
observations, are realistic. 

3. NEBULAR EMISSION MEASUREMENTS 

While elliptical galaxy spectra are broadly character- 
ized as being dominated by absorption lines, it has long 
been realiz ed that many systems al s o show nebular emis- 
sion lines ("Herbie & Mavall' TdEf; ' Phillips et all lijM 
[Gonzalez 1993; Goudfrooii et al. 199^! 

A particular difficulty in studying integrated stellar 
populations is that nebular H/3 emission acts to 'fill in' 
the stellar absorption line, driving derived age measure- 
ments toward older values. Disentangling the absorption 
and emission components of H/3 is therefore vital if this 
linestrength is to be used to constrain star-formation his- 
tories. Since emission at H/3 is often co-incident with 
emission in [OIII AA4959, 5007A], a standard approach 
is to establish a correction based on the more easily- 
measured [O III] lines, usually assuming a constant cor- 
rection factor of H/3= 0.6x [O III A5007 A] l)Trager et all 
I20n0h() . In this section, we discuss an alternative method 
which aims to distinguish whether galaxies are likely af- 
fected by emission by fitting the spectra with absorption 
template models. 

3.1. Method 

Since galaxy spectra contain substantial structure from 
unresolved stellar absorption lines, weak [OIII] lines can 
be very difficult to distinguish without first removing 
the stellar component. This can be done by subtract- 
ing/dividing a 'matched' spectrum for similar galaxies of 
simila r stellar population p roperties but free from emis- 
sion ( Goudfrooii et alJl 994l or by subtracting/dividing a 
model stellar continuum ( Kuntschncr ct al. 2002) . In the 
latter case it is necessary to exclude the region around 
Mgb where current synthetic spectra (such as those of 
Vazdekis 1999) provide a poor match to observed ellip- 
tical galaxies due to non-solar abundance ratios. This 
limits the nebular lines which can be measured, exclud- 
ing such potentially-interesting features as NIA5199, but 
the method has the advantage of being easily automated 
when the redshift and velocity dispersion are known. 

Our measurements of H/3 and [O III] emission arc made 
on the spectra after dividing by the best-fitting Vazdekis 
()1999fl model, computed over the 4800 - 5100 A spec- 
tral range. The models span a range of metallicity 
-0.7 <[Fc/H]< +0.2 and age 1.0Gyr< t < 17.4 Gyr, 
and assume solar abundance ratios and power-law IMF 
with X = 1.3. The best-fit model is computed by com- 
paring the model and observed spectra after first shifting 
and broadening the model to match the redshift and ve- 
locity dispersion of the galaxy, as determined in Paper I. 
After division by the best-fitting model, a low-order con- 
tinuum is divided out to remove long- wavelength baseline 
variations. Equivalent widths are measured directly on 
the divided spectra (i.e., without assuming an emission- 
line profile), and errors estimated based on the noise in 
the 'line-free' regions. Figure 01 shows some illustrative 
examples of this continuum-removal method. 

It is clear that the model-fitting procedure itself can be 
biased by the presence of H/3 emission. Specifically, for 
galaxies with emission, a model with weaker stellar H/3 



can be fitted, yielding an underestimate of the nebular 
H/3. We have investigated this effect through simula- 
tions, adding emission lines to model population spectra 
of varying ages and velocity broadening. The recovered 
H/3 are biased low by 10-25%, depending on the underly- 
ing spectrum. In spectra of high velocity dispersion, the 
broad wings of the absorption spectra help to distinguish 
the narrow nebular emission lines. In older populations, 
the bias is somewhat reduced because there is little flexi- 
bility to push the fit to even older models. Thus the worst 
cases (where emission H/3 is underestimated by 25%) are 
for low-CT young stellar population spectra. Despite this 
bias, the wings of the H/3 feature are essential to separate 
the broad stellar and narrow nebular contributions. 

3.2. Results 

The above method was applied to 4964 galaxies with 
redshift and velocity dispersion data. (For this pur- 
pose we allowed the use of velocity dispersions from 
lower signal-to-noise spectra not reported in NFPS-I.) 
Of these, significant [OIIIA5007A] emission is detected 
in 589 galaxies, EW([OIII A4959 A]) in 154 galaxies, and 
EW(H/3) in 633 galaxies (all 3cr detection limits). 

The correlations between EW([0 HI A5007 A]), 
EW(H/3), and cr are presented in Figure El From 
Figure 5a it is clear that while the presence of OIII is 
indeed an indicator for H/3 emission, no single ratio 
between the lines is appropriate for all galaxies. In 
particular, there is a substantial population of objects 
with moderate to strong H/3 emission but with little or 
no [OIII]. Plotting the emission lines versus velocity 
dispersion (Figures 5b, c), shows that these objects are 
overwhelmingly of low mass (log a ^ 2.0). Thus for 
low-cr objects, the classical factor 0.6 under-corrects 
for emission contamination, while for high-a objects, 
the factor over-corrects. Note that the objects with 
large H/3 emission are those for which we expect in 
fact to underestimate the measurement, due to the bias 
described above. 

A better predictor of em ission at H/3 is the emission at 
Ha fGaldweU et"aLll2003D . As described in Section O 
HaF linestrength measurements are only available for a 
subset of ~700 galaxies, but these can provide a con- 
sistency check on H/3 emission, as shown in Figure |S| 
The HaF linestrength is constructed so as to be insen- 
sitive to the neighboring [Nil] lines. While HaF does of 
course have some contribution from stellar absorption, 
the ratio between the nebular emission lines is such that 
emission, when present, usually dominates at Ha, even if 
it is weak at H/3. The excellent correlation in the upper 
panel suggests that the emission H/3 measurements are 
indeed at least a reasonable proxy for Ha. The Ha/H/3 
ratio ranges from 2.9 to 5.9 (68% range), with a median 
of 4.5, similar to the typical range of 2-6 for early-type 
spirals l|Stasihska et al.ll2004D . 

3.3. Emission-line rejection criteria 

For galaxies with excess emission in the H/3 and 
OIIIA4959 and 5007A lines, the H/3 absorption line in- 
dex is contaminated and hence the derived stellar prop- 
erties are biased. As described above, a constant correc- 
tion factor is incorrect, even in a statistical sense, since 
the true ratios vary widely and systematically with ve- 
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Fig. 3. — Plots of NFPS linestrengths at instrumental resolution against SDSS linestrengths "scaled" to the same NFPS resolution. Solid 
lines with a slope of unity indicate exact agreement between the two sets of measurements. The average NFPS and SDSS linestrength error 
is shown in the upper left corner of each plot. 



locity dispersion. Therefore, instead of using the con- 
stant correction factor based only on [OIII], we prefer 
simply to exclude galaxies with emission at either H/3 or 
[OIII A5007A]. In Figure [7| we plot the emission mea- 
surements of up against [OIIIA5007A] and indicate the 
cuts made to exclude high-emission galaxies. Only galax- 
ies with EW(H/3) > -0.6 and EW(OIII) > -0.8 are in- 
cluded in our sample of all linestrengths used for analysis 
in the next two sections. 

4. DATA PRESENTATION AND SUMMARY 

Tables El and El present absorption and emission 
linestrengths for all galaxies observed, including those 
with emission. Hereafter, we refer to this as the ALL 
dataset. Table gives linestrengths and errors at Lick 
resolution for galaxies with S/N >15 per Angstrom, at 
5000-5500 A. The linestrengths in the table are corrected 
for velocity broadening, but have not been corrected for 
aperture effects. Additionally, the OIII and 11(3 emission 
equivalent widths with their common error are listed (in- 
dicated by W), along with the heliocentric redshift (cz©), 
velocity dispersion (cr), cluster identification and ratio 
of the angular diameter distances based on CMB-frame 
redshifts. Table |S1 presents the full-resolution data in 
analogous form. The auxiliary parameters (czq , etc) are 
duplicated here for convenience. 

The final galaxy sample will be culled for the analy- 
sis of the linestrength-(T trends and age and metallicity 
scaling relations in Sections and El 

5. LINESTRENGTH-CT RELATIONS 

The overall trends of linestrengths with velocity dis- 
persion reflect changes in the characteristic stellar pop- 



ulations as a function of galaxy mass. In this section 
we describe the linestrength-cr relations observed in the 
NFPS data and compare, where possible, to previous re- 
sults. 

5.1. Sample 

In the remainder of this paper, we will analyze a re- 
stricted sample of the NFPS galaxies, hereafter referred 
to as the CULL dataset. This is defined by applying the 
following cuts: 

1 . Galaxies must belong to one of the clusters in the 
NFPS cluster sample, as determined in NFPS-I. 
This rejects objects in the field and in background 
groups or clusters. 

2. Galaxies must be free of emission lines as deter- 
mined in this paper, having emission weaker than 
0.8 A in OIII5007 and weaker than 0.6 A in H/3. 

Moreover, in the CULL dataset, the velocity dispersions 
and linestrengths are corrected for aperture effects as de- 
scribed in Section 1^1 

5.2. Analysis of Individual Linestrengths with Velocity 
Dispersion 

We restrict our attention to the 20 linestrength indices 
which are least affected by systematic errors. In fitting 
our linestrength-(T data, data points more than 4(t away 
from the initial weighted least-squares fit are excluded. 
We then perform a linear regression of linestrength on 
logcr on the remaining data, allowing for measurement 
errors in both variables but assuming that any intrinsic 
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Fig. 4. — Some illustrative examples of the emission-line measurements. In the left-hand panels are shown the observed spectrum (heavy 
line) and the best-fit Vazdekis (1999) model (thin line). The ratio spectrum, from which the line measurements are derived, is shown in 
the corresponding right-hand panel. Vertical dotted-lines indicate the expected location of the H/3 and [O III] lines. At the top, a spectrum 
with no measurable emission; in the lower three panels, spectra with weaJf-to-moderate emission. Note the substantial range of H/3 to 
[O III] ratios among these examples. 
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Fig. 5. — Emission measurements at [OIIIA5007A] and K/B as a function of velocity dispersion, a. 
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TABLE 5 

Lick resolution linestrength indices and associated data 



Galaxy ID H(5f H(5a 

W(OIII) W(H/3) e(W) H7A Fc4383 

czQ logo- Fc5015 Mgb 
Cluster log(d/do) Fc5709 Fc5782 



CNi 
Ca4455 

Mgi 
Na5895 



CN2 
Fc4531 

Mg2 

TiOi 



Ca4227 
Fc4668 
Fc5270 
Ti02 



G4300 

H/3 
F05335 



H7F 
H/3+ 
Fc5406 
Hap 



NFPJ001006. 3-284623 

-0.06 -0.35 0.22 
18989 1.877 ±0.050 
A2734 0.080 



1.687 ± 0.748 -1.355 ± 1.495 

'4.244 ± 1.202 4.579 ±1.062 

4.472 ± 0.921 3.051 ± 0.442 
1.499 ±0.365 



0.0266 ± 0.0416 

0.674 ± 0.582 
0.0435 ± 0.0133 



0.0574 ± 0.0382 

3.540 ± 0.852 
0.1643 ± 0.0170 



0.016 ± 0.698 
4.268 ± 1.415 



6.174 ± 0.881 
2.068 ± 0.459 
2.884 ± 0.504 



-0.648 ± 0.673 
2.342 ± 0.348 
1.004 ± 0.411 



NFPJOOIOIO. 7-285020 

0.03 -0.13 0.14 
18626 2.041 ±0.039 
A2734 0.080 



0.580 ± 0.553 -1.020 ±1.001 

-2.944 ± 0.836 3.607 ± 0.760 

4.730 ±0.713 3.115 ±0.351 
0.505 ±0.239 



0.0192 ± 0.0306 

1.065 ± 0.336 
0.0600 ± 0.0117 



0.0446 ± 0.0345 

1.936 ± 0.593 
0.1609 ± 0.0151 



1.149 ± 0.421 
4.855 ± 0.966 



4.911 ± 0.675 
2.126 ± 0.334 
2.317 ± 0.380 



-0.361 ± 0.454 
2.341 ± 0.239 
1.687 ± 0.264 



NFPJ001024. 3-284935 

0.10 0.41 0.17 
17923 2.234 ±0.041 
A2734 0.080 



0.272 ±0.645 -1.753 ± 0.988 
-5.047 ±0.870 4.263 ±0.895 
5.946 ±0.764 3.599 ± 0.390 
0.998 ±0.279 



0.0553 ± 0.0267 

1.193 ± 0.443 
0.0563 ± 0.0119 



0.0812 ± 0.0330 

3.046 ± 0.622 
0.2086 ± 0.0156 



1.736 ± 0.448 
4.245 ± 1.072 



4.635 ± 0.766 
1.909 ± 0.363 
3.133 ± 0.471 



-1.225 ± 0.479 
2.030 ± 0.245 
1.768 ± 0.346 



NFPJ001032. 5-285154 

-0.17 -0.04 0.19 
17441 2.156 ±0.047 
A2734 0.080 



0.309 ± 0.781 -0.877 ±1.168 
-6.479 ± 1.041 5.265 ± 1.049 
5.538 ±0.898 4.494 ± 0.422 
1.051 ±0.284 



0.0538 ± 0.0326 

1.230 ± 0.439 
0.1003 ± 0.0129 



0.0830 ± 0.0421 

4.227 ± 0.723 
0.2501 ± 0.0168 



0.912 ± 0.502 
6.701 ± 1.138 



5.587 ± 0.799 
1.282 ± 0.383 
2.553 ± 0.476 



-1.787 ±0.581 
1.531 ± 0.316 
1.312 ± 0.316 



NFPJ001041. 8-283444 

-0.08 0.01 0.21 
18190 1.988 ±0.053 
A2734 0.080 



0.032 ± 1.019 -2.041 ± 1.502 

-5.915 ± 1.202 2.918 ±1.555 

4.396 ±0.987 2.885 ± 0.527 
0.436 ± 0.300 



0.0132 ± 0.0413 

0.692 ± 0.527 
0.0140 ± 0.0135 



0.0593 ± 0.0358 

3.183 ± 0.767 
0.1603 ± 0.0179 



1.862 ± 0.589 
3.673 ± 1.397 



4.676 ± 0.892 
2.344 ± 0.511 
2.059 ± 0.583 



-2.127 ±0.696 
2.435 ± 0.349 
1.595 ± 0.376 



NFPJ001042. 6-284916 -0.028 ± 0.723 -1.420 ± 1.070 0.0459 ± 0.0282 0.0712 ± 0.0332 1.368 ± 0.443 4.639 ± 0.756 -0.680 ± 0.515 
-0.60 -0.80 0.20 -3.697 ±0.933 4.181 ± 0.960 0.949 ± 0.415 2.020 ± 0.796 6.930 ± 1.133 1.633 ± 0.415 1.843 ± 0.309 
18682 1.942 ±0.047 4.511 ± 0.903 3.647 ± 0.454 0.0510 ± 0.0129 0.2026 ± 0.0168 •■■ 1.500 ± 0.472 1.817 ± 0.370 

A2734 0.080 0.760 ± 0.287 ■__ ■__ — ■__ ■__ ■__ 

'l'abicl5lis presented in its entirety in the electronic edition of the Astrophysical .lournal. A Dortion is shown here for guidance regarding its form 
and content. 



TABLE 6 

Full resolution linestrength indices and associated data 



Galaxy ID HSp H<5a 

W(OIII) W(H/3) e(W) H7A Fe4383 

CZ0 log cr Fe5015 Mgb 
Cluster log(d/do) Fc5709 Fc5782 



CNi 
Ca4455 

Mgi 
Na5895 



CN2 
Fc4531 

Mg2 

TiOi 



Ca4227 
Fc4668 
Fo5270 
Ti02 



G4300 

H/3 
F05335 
HaA 



H7F 
H/3+ 
Fc5406 
Hap 



NFPJ001006. 3-284623 

-0.06 -0.35 0.22 
18989 1.877 ±0.050 
A2734 0.080 



2.186 ± 0.977 -1.756 ±1.561 
-4.250 ± 1.221 5.697 ±1.330 
4.972 ± 1.033 3.213 ±0.475 
1.333 ±0.338 



0.0355 ± 0.0418 

0.885 ± 0.755 
0.0448 ± 0.0129 



0.0824 ± 0.0493 

3.795 ± 0.909 
0.1671 ± 0.0164 



0.141 ± 0.692 
4.022 ± 1.343 



7.007 ± 1.039 
2.189 ± 0.499 
3.212 ± 0.559 



-0.874 ± 0.703 
2.211 ± 0.338 
1.016 ± 0.434 



NFPJOOIOIO. 7-285020 

0.03 -0.13 0.14 
18626 2.041 ±0.039 
A2734 0.080 



0.703 ± 0.689 -1.443 ± 1.079 
-3.011 ±0.850 4.431 ±0.941 
5.426 ±0.822 3.179 ± 0.371 
0.546 ±0.274 



0.0275 ± 0.0306 

1.551 ± 0.479 
0.0609 ± 0.0110 



0.0515 ± 0.0366 

2.328 ± 0.701 
0.1617 ± 0.0143 



1.345 ± 0.498 
5.154 ± 1.025 



5.296 ± 0.783 
2.185 ± 0.361 
2.829 ± 0.456 



-0.446 ± 0.491 
2.322 ± 0.246 
2.094 ± 0.343 



NFPJQ01024. 3-284935 

0.10 0.41 0.17 
17923 2.234 ±0.041 
A2734 0.080 



0.243 ± 0.677 -2.122 ± 1.066 
-4.579 ±0.880 4.710 ± 1.004 
7.011 ±0.902 3.674 ±0.409 
1.067 ±0.310 



0.0744 ± 0.0312 

1.509 ± 0.555 
0.0589 ± 0.0114 



0.1053 ± 0.0367 

3.436 ± 0.706 
0.2105 ± 0.0149 



1.929 ± 0.507 
4.078 ± 1.048 



4.841 ± 0.850 
1.934 ± 0.385 
3.641 ± 0.543 



-1.280 ± 0.514 
2.055 ± 0.258 
1.911 ± 0.390 



NFPJQ01032. 5-285154 

-0.17 -0.04 0.19 
17441 2.156 ±0.047 
A2734 0.080 



0.490 ±0.809 -1.176 ±1.242 
-6.273 ± 1.036 5.713 ±1.152 
6.469 ± 1.051 4.668 ± 0.449 
1.237 ±0.344 



0.0668 ± 0.0372 

1.811 ± 0.637 
0.1047 ± 0.0125 



0.0923 ± 0.0440 

4.644 ± 0.800 
0.2541 ± 0.0161 



1.185 ± 0.646 
6.825 ± 1.166 



6.150 ± 0.924 
1.477 ± 0.457 
3.193 ± 0.589 



-1.862 ± 0.612 
1.438 ± 0.308 
1.727 ± 0.426 



NFPJQ01041. 8-283444 

-0.08 0.01 0.21 
18190 1.988 ±0.053 
A2734 0.080 



0.079 ± 1.049 -2.570 ±1.570 

-5.704 ± 1.222 2.755 ± 1.470 

5.083 ± 1.145 2.890 ±0.536 
0.539 ±0.383 



0.0279 ± 0.0415 

0.996 ± 0.747 
0.0154 ± 0.0136 



0.0929 ± 0.0489 

4.164 ± 0.983 
0.1604 ± 0.0173 



2.060 ± 0.657 
3.711 ± 1.406 



5.373 ± 1.063 
2.386 ± 0.532 
2.252 ± 0.637 



-2.085 ± 0.722 
2.425 ± 0.356 
1.890 ± 0.458 



NFPJQ01042. 6-284916 0.077 ± 0.758 -1.731 ± 1.139 0.0586 ± 0.0329 0.0961 ± 0.0382 1.571 ± 0.514 4.958 ± 0.858 -0.832 ± 0.546 
-0.60 -0.80 0.20 -3.647 ±0.947 4.662 ± 1.085 1.342 ± 0.577 2.329 ± 0.890 6.974 ± 1.150 1.751 ± 0.460 1.754 ± 0.304 
18682 1.942 ±0.047 4.843 ± 0.982 3.684 ± 0.469 0.0529 ± 0.0125 0.2069 ± 0.0163 •■■ 1.864 ± 0.580 2.101 ± 0.441 

A2734 0.080 0.869 ± 0.340 

'I'abiel^is presented in its entirety in the electronic edition of the Astrophysical Journal. A portion is shown here for guidance regarding its form 
and content. 



10 



-4^ 




Fig. 6. — Emissi on-lin e measurements, versus the HaF 
lincstrcngth (see Section 13.21 . Units are in A. 
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Fig. 7. — Equivalent widths of OIIIA5007 and H/3 emission lines. 
Solid lines are lines of constant emission. Points above the hori- 
zontal line and right of the vertical line are galaxies excluded from 
our CULL dataset. Units are in A. 



scatter comes solely from the linestrengths. Table[7|sum- 
marizes the parameters of our fits. Slopes and errors are 
quoted at both full (instrumental) and Lick resolution. 
In FigurelHl the linestrength-o- relations for 20 indices are 
shown, both for individual data points and for means in 
velocity dispersion bins. The solid curves show the fitted 
slopes. 

Of all the line indices, magnesium correlates most 
strongly with a and has been most widely used as an 
indicator of stellar populations. In order to compare our 
Mg-cr fit with those from other surveys, we convert our 
Mg6 line i n A to Mg6' in magn itudes using the conversion 
defined in lColless etal] ljl999|) : 



Mg&' = -2.5 log 
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Mg&\ 
32.5 / 



(3) 



very close to th at found by EFAR (0.13 ±G.017), SDSS 
(0.15±0.02) and lKuntschner et all ll20?il r0.1 42zfcn.013l. 
When the data from iKuntschner et alJ i 2001|) are fit us- 
ing our fitting method, the slope (0.113) is closer to ours. 
Our intrinsic scatt er of 0.011 mags i s lowe r than both 
those of EFAR and IKuntschner et alJ l|2001l) ; we do not 
compare our scatter to that of SDSS since they effectively 
reduce their scatter by creating composite spectra from 
individual galaxies. 

We take the logarithm of our H/3 linestrengths to com- 
pare the subsequent trends with a with other surveys: 
our H/3 fit yields log H/3 oc (-0.306 ± 0.008) log a for 
3440 galaxies compared to a slope of —0.24 ± 0.03 for 
SDSS. Jorgcnscn (1997) found log H/3 oc (-0.23 ± 0.08) 
log a with an intrinsic scatter of 0.061 compared to our 
intrinsic scatter of 0.041. We discuss the effects on our 
conclusions of a different H/3 slope in Section 16.31 

In order to compare our iron relations with other sur- 
veys, we defined <Fe> as 



< Fe > 



Fe5270 -fc Fe5335 



(4) 



For ^3400 galaxies, the Mgb' linestrength is fit with the 
relation A Mgb' oc (0.122±0.002) A(logcr). This slope is 



For 1090 galaxies with both Fe5270 and Fe5335 measure- 
ments, our best fit to log a is log <Fe>oc (0.088±0.008) 
logg. This is consistent with the lower value found by 
iJorgensen (1997) (0.075±0.025) and the higher value in 
the SDSS (0.11±0.03). 

The linestrength Fe5709 has a negative slope against 
log (7, and is the only iron line to behave in this manner. 
While it is possible that the line would be contaminated 
by strong sky emission from the Na IAA5683,5688 lines, 
this would only affect measurements in the extremely 
low-redshi ft clusters. W e have analyzed the linestrength 
data from iTrager et all l)1998D . and, for Fe5709, find a 
slope of -0.06±0.09, which is consistent with our result. 

We have shown that each of the 20 linestrengths con- 
sidered shows significant dependence on velocity disper- 
sion. The pattern which emerges is that all Balmer lines 
decrease with increasing velocity dispersion, whereas all 
other lines (with the exception of Fe5709) increase with 
increasing velocity dispersion. The following section de- 
scribes a robust method for interpreting these trends as 
scaling relations of stellar population properties, as a 
function of galaxy mass. 

6. GLOBAL AGE AND METALLICITY TRENDS 

Stellar pop ulation models, such as those by 
iThomas et all (j2003j) (hereafter TMB) and the exten- 
sion of these which includes H7 and US (jThomas et alJ 
l2004b(l . predict linestrengths from a range of stellar 
population parameters. These models are an extension 
of those by iMarastoni |199^ with adjustments based 
on theoretical stellar atmosphere calculations. TMB 
are currently the only models that predict linestrengths 
for populations with non-solar abundance ratios, as 
required for elliptical galaxy studies. TMB do not, 
however, include possible contributions from blue 
Horizontal Branch (BHB) stars, w hich might arise from 
a low -metallicity subpopulation ijMaraston fc ThomasI 
'2000'), or from enha nced mass-loss in evolved stars 
(Thor oas et al.l I2004aj) . Whatever their origin, such 
stars would contribute to a strengthening of the Balmer 
absorption lines, mimicking the effect of younger ages. 
In principle, BHB stars have greater impact on the 
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TABLE 7 

LiNESTRENGTH-CT FiTS 



Linestrength Units NQ^iaxies^ Mean Slope (Lick)'' Mean Slope (Inst)*^ 



HSa 


A 


3419 


-3.484 ± 0.109 


-3.381 


± 


0.116 


H(5p 


A 


3399 


-1.444 ± 0.055 


-1.408 


it 


0.065 


CNi 


mag 


3435 


0.197 ± 0.003 


0.193 


it 


0.004 


Ca4227 


A 


3420 


0.912 ± 0.039 


0.421 


it 


0.042 


H7A 


A 


3428 


-3.427 ± 0.096 


-2.997 


it 


0.099 


H7f 


A 


3434 


-2.163 ± 0.052 


-2.222 


it 


0.056 


Fe4383 


X 

A 


3437 


1.448 ± 0.084 


0.990 


it 


0.088 


Ca4455 


X 

A 


3457 


0.631 ± 0.033 


0.451 


it 


0.040 


Fe4531 


X 

A 


3442 


0.917 ± 0.048 


0.643 


it 


0.056 


Fe4668 


X 

A 


3443 


5.230 ± 0.109 


4.885 


it 


0.112 


H/3 


X 

A 


3450 


-1.171 ± 0.032 


-1.166 


it 


0.037 


Fe5015 


X 

A 


3450 


1.029 ± 0.068 


1.008 


it 


0.077 


Mgi 


mag 


2018 


0.121 ± 0.003 


0.123 


it 


0.002 


Mg2 


mag 


lyyo 


n 1 Qo -1- n nriQ 
U.ioy it U.UUo 


n 1 Q/1 
U.lo4 


1 

it 


U.UUo 


Mgfe 


A 


3414 


3.201 ± 0.041 


3.047 


it 


0.043 


Fe5270 


A 


1954 


0.620 ± 0.041 


0.614 


it 


0.045 


Fe5335 


A 


2113 


0.821 ± 0.045 


0.854 


it 


0.053 


Fe5406 


A 


2825 


0.432 ± 0.029 


0.432 


it 


0.033 


Fe5709 


A 


2503 


-0.131 ± 0.022 


-0.096 


it 


0.026 


Fe5782 


A 


2382 


0.172 ± 0.024 


0.289 


it 


0.027 


Na5895 


A 


1514 


4.276 ± 0.075 


4.590 


it 


0.084 


TiOi 


mag 


789 


0.021 ± 0.002 


0.021 


it 


0.002 


TiOz 


mag 


916 


0.046 ± 0.002 


0.046 


it 


0.003 


HaA 


A 


542 


-0.722 ± 0.148 


-0.791 


it 


0.067 


HoF 


A 


684 


-0.952 ± 0.104 


-1.147 


it 


0.099 



at Lick resolution. 
^ at full (instrumental) resolution. 



high-order hnes than on H/3 which could ultimately 
provide a means to disti nguish their effects fr om those 
of a younger population ijSchiavon et al.ll2'004^ . In this 
paper, we will assume that the BHB contribution is 
either negligible, or at least not strongly dependent on 
galaxy mass. 

6.1. Method 

We wish to use our linestrength data to determine ages, 
metallicities, and a-element enhancements as a function 
of galaxy mass. There are several ways to extract this 
information. The usual method, adopted by most pre- 
vious studies, is to derive these parameters for individ- 
ual galaxies by interpolating (sometimes extrapolating) 
the model grids. However, when the measurement er- 
rors in the linestrengths are non-negligible (as is the 
case with the NFPS data), the tilt of the model grid 
leads to cor related errors in age and metallicity for each 
galaxy ( Kuntschner et all 12001'). These correlated er- 
rors complicate the interpretation of the data, poten- 
tially generating a spurio us age-metallicity correlation 
l|Terlevich fc Forbesll200l . 

This problem can be overcome by stacking the spectra 
of similar galaxies, for exa mple galaxies within a narrow 
bin of velocity dispersion l)Bernardi et a]ll2003l) . to cre- 
ate a composite spectrum of high signal-to-noise ratio. 
Equivalently, one can average the linestrength measure- 
ments themselves for galaxies in each bin. In cither case, 
one averages over scatter in the linestrengths that may be 
due to measurement errors, but also averages the scatter 
due to intrinsic (possibly correlated) dispersion in the 



galaxy population parameters. The obvious weakness 
of this approach is that one can only state confidently 
the characteristics of the mean galaxy; thus, there is no 
information regarding the variation in parameters from 
galaxy to galaxy within the bin. 

As a preliminary step, we plot our binned linestrengths 
on top of grids derived from linestrength-parameter 
scaling relations from TMB in Figure |H| The quan- 
tity [MgFe]' in the lower two panels was defined by 
iThomas et all (|2003j) as 

[MgFe]' = VMg6(0.72 x Fe5270 + 0.28 x Fe5335). (5) 

This index was defined to be sensitive to overall metal- 
licity, but almost independent of [a/Fe]. In Figure|51 the 
area inside the dotted trapezoid represents the average 
error for a single linestrength measurement in each bin. 
Since each bin represents ~700 galaxies, the errors in the 
mean values are very small, and not plotted here. 

In the upper- left panel of Figure |51 where we plot mod- 
els with [a/Fe] = 0.0, our five points form a line of almost 
constant metallicity [Z/H] = 0.0 along with an increas- 
ing age estimate as the velocity dispersion increases. In 
the upper-right panel where [a/Fe] = 0.3, we can see 
a slight shift of our data points toward higher metallic- 
ity and again toward older ages, with increasing a. In 
the lower panels, the grids show predictions for constant 
age, while our five binned points follow a rough trend 
in metallicity - increasing [Z /H] with increasing velocity 
dispersion. There is a slightly less pronounced trend of 
increasing [a/Fe] with increasing a. It should be realized 
that, since each panel shows a two-parameter projection 
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Fig. 8. — Our linestrength-o- relations for each index. The linestrength and velocity dispersion for the galaxies (points) are sorted into five 
bins by increasing velocity dispersion (black boxes outlined in white). Solid lines are the slopes from the linear regression of linestrength 
on logo-. Dashed lines represent slopes predicted from our derived trends of [a/Fe], [Z/H], and age with velocity dispersion in Section l6l 
(see text in that section). 



of the three-parameter models, it is not trivial to read 
off the "correct" values from these plots. 

The stacking methods described above work well for 
a triplet of linestrengths (e.g. H/3, Mg6, <Fe>), from 
which one derives the triplet of galaxy parameters age, 
metallicity and alpha-enhancement. However, in gen- 
eral, each choice of linestrength triplet will lead to dif- 
ferent stellar population parameters. Ideally, we seek a 
method to combine the information from all linestrengths 
simultaneously. An obvious approach is to fit, via a 
method, all linestrengths as a function of the galaxy pa- 
rameters ijProctor et al.l2004aj^ . Unfortunately, the zero- 
points are typically uncertain, both for the model pre- 
dictions and for the observational measurements. As a 
result, the model grids often systematically under- or 
over-predict certain linestrengths. Inverting the grids 
then yields estimated ages (and other parameters) that 
are systematically in error. In a approach, the 
linestrengths which are affected in this way will be out- 
liers in the fit and will carry disproportionate weight in 
the results. 

Here, we choose instead to describe the whole sample 



simultaneously, in terms of a set of scaling relations, with 
log(age), [Z/H] and [a/Fe] each following a linear trend 
with logfT. Here [Z/H] is the total metallicity, expressed 
logarithmically relative to solar, while [a/Fe] is the a- 
element enhancement, similarly expressed. These global 
trends can be estimated by modeling their combined ef- 
fects on the observed linestrength-cr relation. We can 
find the parameter-cr relations from 

dl dl dP 

dloga ~ ^^dPd\og<T ^ ' 

where / is the linestrength (at Lick resolution), P is 
a parameter from the stellar population models (log 
age, [Z/H] or [a/Fe]), d//dlog(T are our linestrength- 
cr slopes from the previous section, and d//dP are the 
linestrength-parameter "responses" from the TMB mod- 
els. While the absolute zero-point of the grids is un- 
certain, the d//dP responses are more robust. We em- 
phasize that our dl / da relations involve the averaging of 
'-^^3000 linestrength measurements for each index; thus, 
statistical errors are suppressed and do not significantly 
influence the results of the regression. The aim of this 
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Fig. 9. — Model grids from stellar population models by IThomas et aTl 120031) . Each data point is the mean of a range of velocity 
dispersion, with each bin having approximately the same number of galaxies. The size of the points increases as a increases. The area 
inside the dotted line is the range covered by the average errors of the linestrengths. Linestrengths are in units of A. 



analysis is to constrain the dP/dlogcr representing the 
slope of the parameter scaling relations. 

This differential method has a number of advantages 
over the "grid inversion" approach. As already men- 
tioned, it avoids the problem of interpreting correlated 
errors in the derived parameters. Also it uses only 
the relative changes in the predicted linestrengths from 
the TMB models and the relative changes in our mea- 
sured linestrengths with velocity dispersion. Our method 
is thus explicitly insensitive to calibration uncertain- 
ties in the models and to overall zero-point errors in 
the linestrength measurements. Finally, unlike stacking 
spectra or linestrength measurements by velocity disper- 
sion, there is no need to bin the data into arbitrarily- 
defined subsets. The main shortcomings of the method 
are its assumptions that the model grids are parallel over 
the parameter-space spanned by our sample and that the 
parameter-log (T relations are indeed linear. 

The outputs of the differential method are the 
parameter-cr scaling relations, which depend on the 
choice of the input model responses d//dP. To estimate 
d//dP, we calculate A//AP from the model grids them- 



selves by differencing an upper (or lower) value with a 
fiducial value. For example, our data span 3-15 Gyr in 
age, with a central age of 8 Gyr. Referring to Fig El for 
a metallicity of [Z/H] = 0.0 and [a/Fe]=0.0, we see that 
AH/3pred = -0.288 between 5 and 10 Gyr. Note that the 
grids are very nearly parallel. Thus, if we had assumed 
[Z/H] = 0.35 instead, we would have obtained a very 
similar value, AH/^pred = —0.294. To obtain the best 
estimates of the model responses, we need to make rea- 
sonable choices for the central values of the three parame- 
ters, and for their upper and lower values. We adopt as a 
central value the parameters from our fits corresponding 
to the median velocity dispersion (log a ^ 2.1): [a/Fe] 
= 0.2, age — 8 Gyr, and [Z/H] the average of 0.0 and 
0.35. The range for each parameter was restricted to 
the range spanned by the sample galaxies, i.e. age: 3-15 
Gyr, [a/Fe]: 0.0-0.5, and [Z/H]: -0.33-0.67. Because the 
grids are not exactly parallel, there is a small uncertainty 
in our derived parameter scaling relations introduced by 
our choice of central values. We will discuss this in more 
detail in Section ing] 
To fit three model parameters, we need at least three 
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linestrength-cr relations. We can over-constrain the fit 
by including more than three indices, and in practice we 
use the following 12 line indices: CNi, H/3, H7F, HSp, 
Fe4531, Fe4668, Mg&, Fe5270, Fe5335, Fe5406, Fe5709, 
and Fe5782. This set includes three age-sensitive Balmer 
lines, two indices quite sensitive to a-enhancement (Mg&, 
CNi) and a range of mainly metallicity-sensitive features. 
We did not include lines such as Fe4383 and Fe5015 where 
the systematic correction to the linestrength is a signifi- 
cant factor of the random error (see Sect ion . Ca4227 
was excluded because in ge neral the Ca abundances are 
not very well understood l|Saglia et al. 200^, and we 
avoid including partially redundant indices, for instance 
H7A. Alternative choices of indices are possible and the 
selection of which to include can affect the fit results by 
more than the formal random errors. We discuss this 
issue further in Section ing] 

The d//dlogcr slopes we use from our data derive from 
the regression of linestrength on logcr, as in Table [3 
To weight contributions from the different line indices 
in the fit, we use the formal regression errors on the 
linestrength— CT slopes. No contribution from model error 
is included. 

6.2. Results 

When all 12 line indices are used in the fit, we obtain 
a/Fe (X cr°-3i, 2/H ex a°-^^ and age oc a°-^^. The for- 
mal error estimates are very small (0.01, 0.02 and 0.02, 
respectively), and, as we discuss in greater detail below, 
are not representative of the true uncertainties. For more 
insight into which linestrengths drive the fit results, we 
have explored restricted models in which one or more of 
the parameters is held constant with a. In Figure [TUl we 
show the predicted versus observed linestrength-a slopes 
for the 12 line indices when different combinations of 
the stellar parameters are allowed to vary with a. We 
normalize both the "actual slope" (i.e., our measured 
linestrength-cr slope) and the predicted slope by divid- 
ing by the error in our linestrength-cr slopes. Thus, the 
diagonal line in each panel shows where the actual and 
predicted slopes agree. Any vertical displacement of a 
linestrength indicates an under- or over-prediction of the 
observed slope. 

Figure 10a shows good agreement between the actual 
and predicted linestrength-cr slopes when [a/Fe], [Z/H], 
and age all vary with cr, i.e. our default solution. When 
age is forced to be constant with cr, as in Figure 10b, 
the predicted (negative) slopes of the Balmer lines are 
too shallow and the predicted (positive) slopes of the a- 
sensitive indices, Mgb and CNi, are also too low. Thus, 
without an age-cr relation, we cannot simultaneously re- 
produce both the Balmer lines and the a-sensitive slopes. 

We also see that if metallicity is held constant with 
cr, as in Figure lOd, the predicted slopes of the Balmer 
lines are too steep, as is that of Mg6. Similar arguments 
apply to the other restricted fits. In the case where only 
[a/Fe] is allowed to vary with cr (Figure lOg), the fit fails 
catastrophically, since a correctly-predicted Mg6 slope 
would require [a/Fe] increasing with cr, while a decreas- 
ing [a/Fe] with cr would be needed to produce the ob- 
served negative Balmer line slopes. The result is a very 
poor fit with no [a/Fe] trend at all. 

Using the derived parameter-cr relations, we can re- 
derive our linestrength-cr fits and check the consistency 



with our data-derived linestrength-cr slopes. We used 
our default solution for the parameter-cr relations (a/Fe 
oc cr°-^^, Z/H cx a°-^^, and age cx cr°-^^) to generate the 
predicted slopes (dashed lines) in Figure |H1 For each 
linestrength, the predicted slope is very similar to the 
measured slope with the exception of Fe5709, whose mea- 
sured linestrength-cr slope is opposite to those of the 
other iron line indices. 

In Section we noted t h at th e H/3 slopes found by 
both SDSS and 'JorgensenI ljl997(l are 20% lower than 
ours. We have repeated the analysis using their slope 
but found no significant change in our results, including 
the age-mass gradient. 

6.3. Error Estimation 

The formal errors on the galaxy parameter slopes 
dP/d(logcr3 are very small: specifically, we find a/Fe 
cx aO-3i±o oi, Z/H oc a0-53±0-02, age oc a0-59±0 02. From 
a purely statistical point of view, these errors may 
be slightly underestimated for two reasons: first, the 
linestrength measurements are not strictly independent 
since in a few cases the bandpasses overlap (e.g., Fe5270 
and Fe5335), and second, it is possible that the scatter in 
intrinsic parameters from galaxy-to-galaxy may be cor- 
related (e.g., Trager et al. 2000b suggests that at a given 
cr, younger galaxies are also more metal-rich). Neither 
of these effects will bias the resulting age, metallicity 
and alpha-enhancement trends, which are essentially the 
means of the galaxy parameters at a fixed sigma. How- 
ever, we will show below that systematic effects vastly 
dominate over the statistical errors, so will use the for- 
mer as our error estimate and neglect the latter. There 
are two sources of systematic error: one is the choice 
of linestrengths indices used in the regression; the other 
is due to the uncertainty in estimating the responses 
dl/dP. 

To estimate the systematic errors due to choice of 
linestrength indices, we divide our 12 line indices into 
three groups: Balmer lines (H/3, H7F, HSp), a-sensitive 
indices (Mg&, CNi) and the seven Fe-dominated indices. 
We then examine the scaling relations that result when 
only one index from a given group is used, and all in- 
dices from the other two groups are used. Using Fig- 
ure as a guide, we can draw several conclusions. First 
the only consistently "outlying" line indices are Fe5335 
and Fe5782, which prefer a weaker age trend, a stronger 
metallicity trend, and a weaker [a/Fe] trend, and Fe5709, 
for which the reverse is true. In general, there is little 
spread of the scaling relations; choosing other combina- 
tions of linestrength indices yield consistent results. In 
particular, note that the age gradient is essentially un- 
changed whichever Balmer line is used. Note also that 
the scatter along the age-metallicity degeneracy line is 
such that if the age trend is steeper by 0.1 the metallic- 
ity trend is flatter by '~0.03. We estimate the resulting 
errors on our scaling relations by calculating the stan- 
dard deviation of the results from all of the linestrength 
combinations. We find the deviations to be 0.03 in the 
[a/Fe]-logcr relation, 0.06 in [Z/H]-logcr, and 0.08 in age- 
logcr. 

Our second error component is the deviation of our 
parameter-CT scaling relations when different sets of 
d//dP values were used in the regression. We consider 
eight cases in which are central values of [a/Fe], [Z/H], 
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Fig. 10. — A comparison of the predicted index-cr slopes (vertical axis) with the observed slopes (horizontal axis) for different combi- 
nations of the three stellar population parameters. Labels indicate the results for each of the 12 lincstrcngth indices. The diagonal line 
indicates perfect agreement between model-predicted and measured slopes. Note that both model-predicted slopes and measured slopes 
are normalized by the formal error in the measured slope. Thus the vertical offset from the line indicates the degree to which the model 
fails to predict the observed slope. Panel (a) shows our default solution in which [o/Fe], [Z/H] and log(age) are scaled with logcr. The 
model predictions for Fe5709 deviate from the observed slope at a high significance level (compare with Fig 8). Note, however that Fe5709 
is discrepant in all panels. Panels (b)-(d) show the comparisons when we assume no trend with u for age, [a/Fe] and [Z/H] respectively. 
In these cases, there are multiple indices which deviate strongly from the model predictions. For example, if we assume no age gradient 
(panel b), wc cannot simultaneously reproduce the slopes of Mg6 and CNi and the Balmer lines. The fits are considerably poorer when 
only one parameter is allowed to vary, as in panels (e)-(g). 



and age are perturbed by to;i, ±0.35 and ±2 Gyr respec- 
tively, and the responses d//dP accordingly recalculated 
around these central values. We find a standard devia- 
tion of 0.07 in the [a/Fe]-logcr slope, 0.08 in [Z/H]-loga, 
and 0.10 in log(age)-logcr. When these errors are added 



in quadrature with the errors from different linestrength 
combinations, tlic total error is 0.06 in [a/FeJ-logcr, 0.08 
in [Z/H]-logc7, and 0.13 in log(age)-logcr. 

In summary, we see that the inclusion or exclusion of 
individual indices and the choice of input d//dP values 
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Fig. 11. — The effect of inclusion of certain linestrengths on our regression. Each label or open circle represents the inclusion of that 
element and the exclusion of the other elements in that group (see text). For example, the Fe5709 label indicates the fit results when Fe5709 
is the only Fe-dominated index included (all of the Balmer and a-sensitive indices are retained for this test, however). Similarly, the CNi 
label indicates that for that fit, CNi was the only a— index included (i.e., Mg6 was excluded), but all of the Balmer and Fe indices were 
also included. For clarity, only indices which have a significant effect on the default solution are labeled; other indices are shown as open 
circles. The straight line in the left panel is drawn by eye to represent the linearity of the age-metallicity degeneracy. The rectangle in each 
panel outlines our default solution with error range. 
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Fig. 12. — Our range of stellar population parameter scaling relations with cr plotted along with those from other groups. (CA03 = our 
estimate from Figure 21 and Table 9 from Caldwell et al. (2003); TRAOO = ITrager ct al. 12000a); TH04 = Thomas ct al. 1200®). Open 
circles represent the range in NFPS sca ling relations when certain linestrengths are included from the fit while the rest in their group have 
been excluded (see text and Figure [TTl . 



affect the results by considerably more than the formal 
errors. The scatter between various results allows esti- 
mation of more realistic uncertainties. Our final results, 
with errors, are: a/Fe oc cr°-3i±o.06^ Z/H cx cr°-53±o.08^ 
and age oc (jO-59±o i3^ These errors are correlated in the 
sense that a stronger age trend corresponds to a weaker 
metallicity trend and vice-versa. 

6.4. Grid inversion: a consistency check 

The slopes analysis of the previous section makes the 
assumption that the grids are linear and parallel (i.e. that 
a single set of responses dl/dP is sufficient to capture the 
structure of the models). Also, we imposed a linear re- 
lationship of each population parameter with log a. We 
have argued that despite these restrictions, this explic- 
itly differential method is more robust against calibra- 
tion uncertainties in both the data and the models, than 



direct interpolation of model grids, and has the further 
advantage that it does not require explicit binning. 

In this section, we return to the grid interpolation 
method as a consistency test, and to assess the limita- 
tions of the slopes analysis. Since the individual measure- 
ments are subject to sizable random errors, we use the 
linestrengths averaged over the five velocity dispersion 
bins of Figure El The model grid is inverted by deter- 
mining, for each velocity dispersion bin, the parameters 
(log age, [Z/H], [a/Fe]) which best reproduce (accord- 
ing to a statistic) the same twelve line indices used 
above. This analysis yields an estimate of the three pop- 
ulation parameters as a function of log a which is not 
forced to the linear form imposed in the slopes method. 
The results are shown in Figure 1131 The grid-inversion 
approach yields age-cr, [Z/H]-(t and [a/Fe]-a relations in 
excellent agreement with results from our slopes method. 
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Fig. 13. — Fundamental scaling relations of the population pa- 
rameters with a. The thick line shows average age, metallicity and 
[a/Fe] determined in bins of velocity dispersion, using the grid- 
inversion method. The set of line indices employed is the same as 
that in the slopes method. For comparison, the linear relations 
obtained from the slopes method are overplotted as the thin red 
line, with zero-point adjusted to match the grid-inversion results. 
The light-shaded re gions indicate the maximum internal scatter 
estimated in Section l6.5l 



At face value, the age of the most massive galaxies is 
^llGyr, but for the reasons emphasized elsewhere, the 
absolute values of age, [Z/H] and [a/Fe] are less secure 
than the relative change along the mass sequence. The 
upper panel of Figure 1131 suggests a non-linearity in the 
age-logcr relation, which appears to steepen by almost 
a factor of two at the lowest masses (i.e., low a). Such 
behavior is, by definition, not observed in the slopes anal- 
ysis of the previous sections. 



6.5. Internal population scatter 

An issue of considerable interest is the degree of in- 
ternal scatter among galaxy properties at a given point 
on the mass sequence. For instance, the tightness of the 
color-magnitude relation has been used to i nfer limits on 
the s pread in formation ages of ellipticals ijBower et alJ 
Il992j) . The scatter around linestrength-a relations in 
principle provides very powerful constraints, since each 
line index has different sensitivities to the underlying 
population parameters. 

While an upper limit to the scatter in linestrength-cr 
relations is readily available from the measured scatter, 
estimating the intrinsic scatter requires accurate knowl- 
edge of the experimental errors, and a robust estimation 
method. In each bin, we first fit a linear linestrength- 
cr trend (allowing the slope and zero-point to vary from 
bin to bin if the data require this). Then we model the 
linestrength residuals assuming a constant intrinsic scat- 
ter for that bin S{I), in addition to the measurement 
errors (which include the systematic components esti- 
mated in Section [2. 4|l . In this way we can determine the 
range of iS'(/) values which can reproduce the observed 
inter-quartile range of the data. Relative to a standard 
maximum likelihood estimate this is more robust against 
outliers, and focuses on matching the core of the distri- 
bution. 

The 95% confidence limits on S{I) are plotted in Fig- 
ure for a subset of our line indices, for the five velocity 
dispersion bins used elsewhere in this paper. The line in- 
dices shown are those in which intrinsic scatter is most 
cleanly detected: Mgfe, II7F, Fe5406 and Fe4668. 

Given a set of models such as TMB, the scatter S{I) in 
each line index yields an upper and lower bound on the 
internal age scatter 5 (log age), and similarly for [Z/H] 
and [a/Fe]. Using multiple line indices, these constraints 
can be compounded, to leave a narrow range in each pa- 
rameter which is consistent with the observed scatter. In 
this way, we have determined the internal dispersion con- 
sistent with the observed scatters shown in Figure^] To 
simplify the problem, we have assumed that the parame- 
ter scatters are independent, i.e. within each bin there is 
no internal correlation between age-cr and metallicity-cr 
residuals. In converting parameter scatter to predicted 
linestrength scatters, we use model responses appropri- 
ate to each bin, given the grid inversion results above. 
Note that we do not use II/3 here; significant intrinsic 
scatter in 11/3 is observed, at a level inconsistent with the 
other line indices under any model for the underlying 
distribution. This is likely due to variations in low-level 
residual nebular emission. 

The results of these calculations are included in Ta- 
ble IHl along with the results of the grid inversion anal- 
ysis. For each bin, we tabulate the range and average 
of cr, the average stellar population parameters, and the 
range of population scatter consistent with the limits in 
Figure 1X41 The parameter scatters should be used with 
caution, since they are quite sensitive to the accuracy of 
our observational error estimates. Taken at face value, 
however, the analysis suggests a ^25 % scatter in metal- 
licity at given cr, while the age scatter appears to vary 
with mass, increasing from ~15% at high cr to ~40% for 
the low-cr bin. 

In the next section, we discuss our results from this 
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TABLE 8 

Age, metallicity and [o/Fe] by ct-bin 



Range of u 


(log cr) 


{Age) 


([Z/H]> 


([a/Fo]> 5(log{Agc)) 


S([Z/H]) 


S{[a/Fe]) 


28 - 107 


1.935 


4.57 


0.05 


0.13 


0.05 - 0.15 


0.07 - 0.12 


< 0.13 


107 - 135 


2.088 


6.87 


0.11 


0.18 


0.08 - 0.19 


0.06 - 0.10 


< 0.10 


135 - 162 


2.173 


7.93 


0.15 


0.20 


< 0.11 


0.07 - 0.11 


< 0.08 


162 - 200 


2.257 


9.31 


0.18 


0.24 


< 0.08 


0.06 - 0.10 


< 0.09 


200 - 434 


2.386 


10.82 


0.24 


0.28 


0.03 - 0.08 


0.04 - 0.08 


< 0.07 



section in comparison with those of other studies, and 
consider the relation of our results with recent observa- 
tions of intermediate-redshift clusters. 

7. DISCUSSION 
7.1. Interpretation of the scaling relations 

In the following sections, we compare our results to 
other similar studies and to observations at higher red- 
shift. First, however, it is useful to reiterate some caveats 
to interpreting the scaling relations derived here. 

As in any survey, the sample selection criteria must 
be borne in mind when discussing our results. The 
NFPS sample analyzed here includes only cluster mem- 
bers, based on the criteria of NFPS-I. Within each clus- 
ter, the galaxies were selected by apparent magnitude (to 
R = 17) and color. Compared to some previous surveys, 
our sample probes to fairly low mass, a ~ 50 km s~^. 
The nonlinearity suggested by Figure ^] is such that 
samples of more massive objects would yield shallower 
age trends than studies covering the low-cr regime. The 
color criterion rejects blue galaxies further than 0.2 mag 
from a red sequence fit to each cluster. While such a cut 
should exclude actively star- forming galaxies, there is no 
explicit selection on morphological type. Thus our sam- 
ple includes many SO galaxies and some bulge-dominated 
spirals; such galaxies were likely excluded (on a subjec- 
tive basis) from many studies of "bona- fide" ellipticals. 

To test for differences in the scaling relations be- 
tween morp hological classes, we have used GIM2D 
l)Simard et al. 2002) to derive a bulge-to-total light ratios 
for about half of the galaxies in our sample. We see indi- 
cations that "diskier" galaxies (i.e., those with B/T<0.5) 
follow a steeper age-cr relation. However, there remains a 
significant age-cr trend for the bulge-dominated galaxies. 
In a future paper, when final morphological information 
is available for all of the galaxies, we will address this 
issue in much greater detail. 

An additional element in our selection process is the 
rejection, from our CULL sample, of galaxies with neb- 
ular emission lines. This likely removes preferentially the 
later-type objects from our analysis, but given that the 
emission selection depends in part on H/S, it is impor- 
tant to test for any bias this introduces. To investigate 
the effects of emission selection, we have repeated the 
analysis including galaxies with emission. In this case, 
we find that the linestrength-cr slopes for some of the 
metal lines (e.g., Mg6) are on average steeper than the 
default solution and thus our emission selection results 
in a shallower metallicity gradient with a. However, the 
Balmer linestrength-cr relations also steepen, suggesting 
that emission galaxies harbor young underlying popu- 
lations not totally disguised by emission in-filling. As a 



result, including the emission galaxies yields an age trend 
even stronger than in our default soluti on. Other surveys 
corrected their galaxies for emission iK untschner et all 
l200lt iTrage r et al. 2Q00a) usi ng the [OIIIA5007A] cor- 
rection factor of iTrager et al.l (^OOb) but still included 
them in their analysis of the stellar population trends. 
We have shown (Section |3I) that such a scheme typically 
under-corrects at low a and over-corrects at high a. The 
net effect of this will be to flatten the age-a relation de- 
rived in these studies. 

The scaling relations for age, metallicity and [a/Fe] 
were determined from central spectra. The fiber spec- 
tra sample a physical radius 0.2-0.8 /i~^kpc, depending 
on the distance to each cluster, and linestrength mea- 
surements were aperture-corrected to a common physi- 
cal radius of 0.68 /i~^kpc assuming universal gradients. 
Although the linestrength-a relations and the color- 
magnitude relation are often considered as reflecting the 
same underlying trends, this assumes, perhaps naively, 
that there is a trivial relationship between central stellar 
populations and the global colors. 

A final set of caveats concerns the stellar population 
models used to translate the linestrength— cr slopes into 
scaling relations of population parameters. For sim- 
plicity, these models describe the highly idealized case 
of a single-age, single-metallicity population. In ellip- 
tical galaxies, there may instead be a bro ad distribu- 
tion of metallicities fHarris & Harris^ "200^) . Similarly 
there could be sub-populations of differing ages, as in the 
"frosting" models of ITrager et al.l H200 0al. Strictly, our 
analysis determines scaling relations of the luminosity- 
weighted mean stellar age, metallicity and [a/Fe]. An 
alternative to a true age trend, therefore, would be a sce- 
nario in which all galaxies have a 1 Gyr "frost" represent- 
ing a progressively larger mass-fraction at lower velocity 
dispersion. A separate concern is that the models, as 
described above, do not include the effects of Blue Hor- 
izontal Branch (BHB) stars as observed in metal-poor 
globular clusters. Note, however, that the evidence for 
an age-trend is strongest in the case of H/3 than for HSp 
and H7F. Qualitatively, this is opposite to the signa- 
ture expected from v ariations in the BHB contribution 
(jSchiavon et alJl2'004j) . Robust constraints on this effect 
must await more sophisticated models which simultane- 
ously incorporate BHB stars and a-element abundances. 

7.2. Comparison to other results on scaling relations 

In this section, we compare our results with those from 
other groups, taking into account the important differ- 
ences in sample selection and emission treatment as de- 
scribed above. 

We have already noted that our results, particularly 
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Fig. 14. — Total, measurement and intrinsic scatter around the 
linestrength-(T relationships. In each panel, the black lines show the 
observed (i.e., total) scatter in A (strictly, this is a robust estimate 
of the la scatter, based on the measured interquartile range). The 
95% confidence intervals for the intrinsic la scatter are indicated 
by the shaded boxes. 



the age-CT scaling relation, are not sensitive to which 
Balmer line (H(5f, H7F, H/3) is used. This conclusion 
is reinforced in a separate paper, {Srnith 2005), which 
shows that a strong age gradient is also required to re- 
p roduce the slope of th e NFPS HaA — f relation. 
iTraeer et al.l l)2000bj) used principal component anal- 



ysis to investigate correlations of age, metallicity, and 
[E/Fe] (analogous to [a/Fe]) with structural parameters 
in their sample of early-type galaxies in clusters and in 
the field. Although they model the scaling relations in a 
different way, fitting metallicity and [E/Fc] to age and ve- 
locity dispersion simultaneously, they find E/Fe oc cr'^-^'^, 
very similar to our value of a/Fe oc u^''^^. Their metallic- 
ity gradient is dependent on the individual ages of their 
galaxies, but the velocity dispersion component is Z/H 
oc g" ''^ com pared to our value of Z/H oc a'^-^^. Although 
iTrager et a l. (2000b) do not claim a trend of age with cr, 
a simple fit to their data yields an age-cr trend with an 
exponent of 0.6 ± 0.2, consistent with our results. 

iKmitschncr et al. ( 2001), using linestrengths for a sam- 
ple drawn mostly from clusters, explicitly assume no age 
trend with cr, which strongly affects the other scaling re- 
lations derived. They find a higher A[Z/H]/Alog(T slope 
(~0.9) after correcting for varying [a/Fe]. Their higher 
metallicity slope and lack of an age slope is likely the ef- 
fect of the age-metallicity degeneracy, which we also see 
in the spread of points in the leftmost panel of FigurelTTl 
Modeling the lKuntschner et al. (2001) data according to 
our method, we obtain age oc a~'^'^ if all three parame- 
ters are allowed to vary. Although the relation is steeper 
than our default solution, it is quite sensitive to a single 
outlying low-cr galaxy. Excluding this outlier (which has 
l ater-type morphology) yields age oc (t~'''^. 

iPoggianti et al.l l)200H) obtained age and metallicity es- 
timates of ~280 red galaxies in the Coma Cluster. They 
found a broad range in age at all magnitudes, making 
it difficult to quantify the mean age-a relation, but they 
did note that the fraction of young dwarf galaxies in their 
sample is higher tha n the fraction of young giant galax- 
ies. Furthermore, in IPoggianti et aP l)2004() . the authors 
noted that post-starburst k-l-a spectra were identified 
in dwarf galaxy spectra, with luminosities L <^ O.IL*. 
These results indicate a "down-sizing" effect in that the 
most recent star formation activity occurs at lower red- 
shifts for progressively fainter galaxies. This effect will 
be discussed further in the next section. 

'Caldwell et al. (2003) derived ages and metallicities for 
their sample of 175 early- type galaxies in clusters and in 
the field, including many with cr < 100 km s'^ In Fig- 
ure 21 of their paper, they plot age versus cr, and note a 
strong correlation. Surprisingly, however, these authors 
do not quote a numerical estimate for the slope. From 
their Table 9 and Figure 21, we estimate their log(age)- 
log cr gradient to be 0.8-1.2 depending on the index com- 
bination employed. In addition, they find a shallower 
trend of metallicity than we do (0.32 versus our value of 
0.53). Their age gradient is steeper than ours, but their 
results lie on the age-metallicity degeneracy line for these 
t wo parameters. 

iThomas et al.l l|2004aD quote ages, metallicities and a- 
enhancements for 54 early-type galaxies in high-density 
environments. They do not quote scaling relations for the 
sample as a whole, but rather break the data into sub- 
classes pre-selected by age and velocity dispersion. In or- 
der to compare their results with ours, we have analyzed 
their published data for early-types in high-density envi- 
ronments and derive the following scaling relations from 
a simple unweighted regression on log a: 0.78 ± 0.23, 
0.42 ± 0.14 and 0.36 ± 0.05 for age, metallicity and a- 
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enhancement respectively. These results are in excellent 
agreement with our results, but their errors are large be- 
cause their sam ple contains few low-a galaxies. 

We note that. lProctor et all l|2004bj) also found a pos- 
itive age-cr gradient in galaxies in Hickson Compact 
Groups. They do not quote an age-cr relation but exam- 
ination of their Figure 5 suggests a relation somewhat 
steeper (~ 1.2) than our best fit value. 

Figure II 2 | summari zes our results a nd t hose of 
Trageretal. ( 200*0^) ICaTdwell et all l)2003j) and 
Thomas et al.. ( 2004a). We conclude that our scal- 
ing relations of age, metallicity and [a/Fe] fall within 
the range spanned by previous studies. Differences 
among the above results may arise in part from the 
different choices of linestrengths used (resulting in 
scatter along the age-metallicity degeneracy ellipse), 
and from different sample selection characteristics. 

7.3. Connection to observations at intermediate redshift 

The reality of our steep trend in age as a function of 
mass, along the red-sequence, can be tested with obser- 
vations at high redshift. As previously noted, the abso- 
lute age calibration of the models is less secure than the 
relative ages. If we identify the stellar age of the most 
massive systems (ct ~ 400 km s~^; ^ 20 L*) with 13 
Gyr, close to the age of the Universe, then the least mas- 
sive red-sequence galaxies, with a ~ 60km s~^ (0.01 L*) 
have ages of approximately 4 Gyr, and galaxies with 
a K, 100km s^^ (~ 0.1 L*) would have an ages ~5.5 Gyr. 
Taking these at face value, and considering also the sub- 
stantial spread in age at given a (especially for low-cr 
objects), we would expect to observe strong evolution- 
ary effects at intermediate redshifts. At an earlier epoch, 
some of the stellar mass presently residing on the faint 
end of the red sequence will not have been formed at all; 
some will be present in star- forming galaxies; and only a 
fraction will exist on the red sequence itself. A key obser- 
vational signature of such evolution would be a depletion 
or truncation of the red sequence, affecting progressively 
more massive galaxies with increasing lookback time. 

A number of studies have in fact advanced evidence 
for such a depletion. For example, iSmail etldl l|1998fl 
observed 10 clusters at z « 0.24, corresponding to a 
lookback time of 3.2 Gyr (for a concordance cosmology). 
Examination of their Figure 5 suggests a strong decline 
in the number of red-sequence galaxies at 2.0 — 2.5 mag- 
nitudes below M* . This corresponds to approximately 
tr ~ 100 km s~^. Our age— cr relation indicates current 
ages of 3-8 Gyr for such objects, with a mean of ~ 5 Gyr. 
Thus a substantial fraction of such galaxies either had 
not yet become quiescent at z ~ 0.24, or else did not have 
enough time to age onto the red sequence. We conclude 
that the truncation of the Small et al. red sequence is at 
least approximately consistent with the age— tr relation 
obtained from NFPS. At a somewhat higher redshift of 
z K. 0.75 (lookback time of 7 Gyr), Do Lucia et al (2004) 
examined the red sequence in clusters and found a defi- 
ciency of a factor of two (compared to Coma) for galaxies 
with O.IL* < L < OAL*. Our age-cr trend and scatter 
predicts that in this luminosity range (corresponding to 
cr ss 100 — 150 km s^^), the present-day stellar age is 
7 Gyr on average. Thus we would expect ^ 50% of such 
galaxies to be significantly bluer at those epochs, again 
approximately consistent with the observed depletion. 



For a ~1 Gyr period after star formation ceases, galax- 
ies pass through a post-starburst (or E-l-A) phase, char- 
acterized by very str ong Balmer absorption but no strong 
emission. iTran et "al . ( 2003) studied E-f A post-starburst 
galaxies in three clusters at z—0.3, 0.6, and 0.8. They 
find that the typical velocity dispersion of post-starburst 
galaxies at these redshifts decreases from ^^170 km s~^ 
at z = 0.8 to ~100 km s~^ at z = 0.3. If we identify 
these objects as galaxies in the process of fading onto 
the red sequence, then this trend of more massive galax- 
ies becoming quiescent at higher redshifts fits well with 
our present-day age-cr trend. 

In summary, our results add to recent evidence that 
the red sequence of cluster galaxies has built up gradu- 
ally over cosmic history, progressing from more massive 
to less massive galaxies. Such a scenario is the cluster 
analogue to the "down-sizing" of the charac teristic mass 
of star-forming field galaxies as discussed bv lCowie et alJ 
(1996) and Kauffmann et al. (20Q1). 

8. CONCLUSIONS 

In this paper, we have presented absorption 
linestrength measurements for ~5000 red-sequence 
galaxies in low-redshift clusters. Our survey sam- 
ples galaxies with velocity dispersion ranging down to 
~50km s^^. The absorption data are complemented 
with emission line measurements which can be used to se- 
lect subsamples with only low levels of nebular contami- 
nation. We have employed the slopes of the linestrength- 
velocity dispersion relations to constrain linear scaling 
relations for stellar population parameters. We find 
that more massive galaxies are older, have higher overall 
metallicity, and have higher [of/Fe] ratios than galaxies 
of lower mass (strictly, this refers to central, luminosity- 
averaged properties of the stellar populations). These 
conclusions are quite robust, and in particular are not 
dependent on which of the age-sensitive Balmer lines are 
used in the analysis. Moreover, a more traditional model- 
grid inversion yields consistent scaling relations. 

The most important result of this paper is the very 
clear detection of an apparent age-mass relation for red 
sequence galaxies in clusters. This arises because a 
flat age-cr relation cannot generate the steep slopes of 
the Balmer linestrength-cr relations while simultaneously 
matching the metal linestrength-cr slopes. This is consis- 
tent with the results from a number of other independent 
studies based on smaller samples. 

A strong age-mass relation stands in stark contrast 
to the widespread assumption that cluster ellipticals 
form a metallicity sequence of approximately constant 
age. Moreover, our apparently anti-hierarchical age- 
mass relation disagrees with the predictions from semi- 
analytic galaxy formation models, which suggest either 
that bright er ellipticals have slightly y ounger stellar pop- 
ulations ( Kauffmann & Chariot "1998*) or that early- type 
cluster galaxies should have uniformly old stellar popula- 
tions (Fig. 18 of Kuntschner ct al. (2002), which is based 
upon the models of iGole et al. (2000 )). 

Our results are broadly consistent with claims of a 
truncated or depleted red sequence in clusters at higher 
redshifts (e.g.. Small et al. 1998; de Lucia et al. 2004), 
and of an increase in the average masses of post-sta rburst 
cluste r members with increasing redshift (Tran et alJ 
|2003). Together, these observations suggest a trend of 
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"down-sizing" galaxy formation in clusters, mirroring a 
similar declin e in the characteristic mass of star-formin g; 
field galaxies ijCowie et alJl996HKauffmann et alJl2004D . 
These studies all present a picture of the red sequence 
in clusters building up slowly over cosmic history, pro- 
ceeding from the most massive to progressively lower 
mass galaxies. In particular, galaxies presently on the 
faint end of the red sequence became quiescent only at 
very recent epochs, and are likely the descendants of 
star-forming or post-starburst galaxies in intermediate- 
redshift clusters. 
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